173 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French Portuguese Spanish
Availability:
Freely Available
License:
Size:
300 OtherProduction Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:MEDLINE as a Parallel Corpus: a Survey to Gain Insight on French-, Spanish- and Portuguese-speaking Authors’ Abstract Writing Practice
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aurélie Névéol | MEDLINE parallel corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English French Portuguese
Availability:
Freely Available
License:
Apache-2.0
Size:
31403 translation units OtherProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:A Post-Editing Dataset in the Legal Domain: Do we Underestimate Neural Machine Translation Quality?
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Julia Ive | Post-Editing Dataset in the Legal Domain | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Available, but there's a public domain part and a restricted part.
License:
CC-BY (public domain part) and CC-BY-NC-ND (restricted part)
Size:
3945943 words Production Status:
Newly created-in progress
Use:
authorship detection, genre classification, comparative literature, diachronic linguistics
-
Paper title:The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | João Ricardo Silva | BDCamões | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Available, but there's a public domain part and a restricted part.
License:
CC-BY (public domain part) and CC-BY-NC-ND (restricted part)
Size:
4495379 tokens Production Status:
Newly created-in progress
Use:
parsing, information extraction, authorship detection, genre classification, comparative literature, diachronic linguistics
-
Paper title:The BDCamões Collection of Portuguese Literary Documents: a Research Resource for Digital Humanities and Language Technology
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | João Ricardo Silva | BDCamões Treebank | /N |
Documentation:
None
Written
Ontology,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Freely Available
License:
CC-BY-ND
Size:
41240 synsets Production Status:
Newly created-in progress
Use:
word sense disambiguation, knowledge representation
-
Paper title:The MWN.PT WordNet for Portuguese: Projection, Validation, Cross-lingual Alignment and Distribution
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | João Ricardo Silva | MWN.PT | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Azerbaijani Belarusian Bulgarian Catalan Danish English Estonian Filipino Finnish Hindi Hungarian Indonesian Irish Italian Japanese Kazakh Korean Latvian Lithuanian Mongolian Norwegian Polish Portuguese Russian Serbian (Latin) Slovenian Spanish Swedish Tamil Turkish Ukrainian Urdu Uzbek Vietnamese ces deu ell fas fra isl kat mkd nld ron slk sqi zho
Availability:
Freely Available
License:
GNU-GPL v.3
Size:
45 billion words Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Geographically-Balanced Gigaword Corpora for 50 Language Varieties
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jonathan Dunn | GeoWAC | /N |
Documentation:
https://github.com/jonathandunn/earthlings
Written
Corpus,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Freely Available
License:
Size:
360 KByte Production Status:
Newly created-in progress
Use:
Question Answering
-
Paper title:AIA-BDE: A Corpus of FAQs in Portuguese and their Variations
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hugo Gonçalo Oliveira | AIA-BDE | /N |
Documentation:
None
Written
Tokenizer,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:AIA-BDE: A Corpus of FAQs in Portuguese and their Variations
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hugo Gonçalo Oliveira | NLPyPort | /N |
Documentation:
None
Written
Grammar/Language Model,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
-
Paper title:AIA-BDE: A Corpus of FAQs in Portuguese and their Variations
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hugo Gonçalo Oliveira | AllenNLP ELMo model for Portuguese | /N |
Documentation:
None
Written
Grammar/Language Model,
Language Type:
Monolingual
Languages:
Portuguese
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
-
Paper title:AIA-BDE: A Corpus of FAQs in Portuguese and their Variations
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hugo Gonçalo Oliveira | FastText.cc | /N |
Documentation:
None




